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DIAGNOSTI C iMETHOD 

This invention relates to novel sequence and polymorphisms in the human llt-l gene. 
The invention also relates to methods and materials for analysing allelic variation in the llt-l 
5 gene and to the use of tit- 1 polymorphism in the diagnosis and treatment of angiogenic 
diseases and cancer. Diseases associated with pathological angiogenesis include diabetic 
retinopathies, psoriasis, rheumatoid arthritis and endometriosis. 

l'lt-1 is one of the two receptors for vascular endothelial growth factor (VEGFR-1). 
The other being KDR (VEGFR-2). The l:lt-l protein consists of an external domain containing 
10 seven immunoglobulin like domains, a transmembrane region and a cytoplasmic region 
containing a tyrosine kinase domain. In contrast to other members of the receptor tyrosine 
kinase family, the kinase domain offlt-1 is in two segments with an intervening sequence of 
--70 amino acids, fhe biology of the VEGF receptors has been reviewed (Neufeld et ah, 
(1999) FASBB Journah 13: 1 1-22; Zachary ( 1998) Experimental Nephrology. 6:480-487) and 
15 the tyrosine phosphorylation sites have been identified (rto et al., (1998) J. Biol. Cliem. 
273:23410-23418). 

It is thought that Ht- l may be important in regulating the tissue architecture in 
developing vasculature while the second VEGF receptor (KDR. VEGFR-2)mediates the 
mitogenic and angiogenic eftects of VEGF in eildothelial cells. Evidence to support this 

20 theory has come from knockout studies in mice (Fong et al.. (1995) Nature. 376:66-70). 

VEGF and its receptors are over expressed in many tumour types and blocking of 
VEGF ftmction inhibits angiogenesis and suppresses growth of tumours while over expression 
of VEGF enhances angiogenesis and tumour growth (Skobe et ah, ( 1997y^^ 
3:1222-1227). Several studies have now shown tliat modulation of fit- 1 activity can lead to 

25 anti-tumour activity, A small mpleoule inhibitor, SU5416. was originally developed against 
KDR but has been shown to be active against tlt-h; the authors propose that inhibition of tit- 1 
may lead to interference with the formation of endothelial-matrix interactions (Fong et ah, 
(1999) Cancer Research. 59:99-106). 

Alternative strategies to modulate fit- 1 activity have included the use of ribozymes 

30 (Parry et ah, (1999) Nucleic Acids Research. 27:2569-2577), the synthesis of aptamers to 
inhibit binding of VEGF to its receptor (Ruckman et ah, (1 998) J Biol Ghem. 273:20556- 
20567) and the m v/va tmnsfer of the flt-1 external domain (Kong et ah. ( 1 998) Fiuman Gen^ 
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Therapy. 9:823-833). Chimeric toxins containing VEGF fused to the diptheria toxin have ^ 

been used to target endothelial cells (Arora et ah, (1 999) Cancer Research. 59: 1 83-1 88). 

The nt- 1 cDNA (EMBL Accession Number X5 1 602, 7680 bp) encodes a mature 

protein of 1338 amino acids. The structure of the murme tlt-1 gene has been determined 
5 (Ivondo ciL. (1998) Gene 208:297-305) and has been used to predict the intron/exon 

boundaries within the human gene. The promoter region of the human gene has been 

characterised (Ikeda el al, (1996) Growth Factors. 13: 15 I -1 62; Morishitac^/ a/,, (1995) J Biol 

Chem 270:27948-27953; EMBL Accession Number 064016,1745 bp). The tlt-1 gene, which 

is organised into thirty exons, has been localised to chromosome 13ql2 (Rosnet et al. (1993) 
10 Oncogene 8:73-179). 

Unless otlierwisc indicated or apparent from the conlexl, all exon positions herem 

relate to the positions indicated in EMBL Accession X5 1602, all promoter positions relate to 

the positions indicated in EiVfBL Accession No. 64016, and all intron sequences relate to one 

or other of SEO ID Nos 1 - 5 disclosed herein. 
15 SEQ ID No. 1 (1073 bp) represents exon 17 (positions 483 - 615 corresponding to 

positions 2605-2737 in EMBL Accession No. X51602) and adjacent intron sequences 

(positions 1-482 and 6 16- 1073). 

SECl ID No. 2 (1480 bp) represents exon 21 (positions 438 - 594 corresponding to 

positions 3046-3202 in EMBL Accession No. X5 1602), exon 22 (positions 1025- 1 122 
20 corresponding to positions 3203-3300 in EMBL Accession No. X5 1602) and intron sequences 

adjacent these exons (positions 1-437, 595-1024 and 1 i23-1480). 

SEQ ID No. 3 (726 bp) represents exon 24 (positions 267-278 corresponding to 

positions 3424-3535 in EMBL Accession No. X5 1602) and adjacent intron sequences 

(positions 1-266 and 279-726). 
25; SEQ ID No. 4 (1352 bp) represents exon 26 (positions 285-390 corresponding to 

positions 3636-3741 in EMBL Accession No. X5 1602), exon 27 (positions 652-794 

corresponding to positions 3742-3884 in EMBL Accession No/ X5 1602) and intron sequences 

adjacent these exons (positions 1-284, 391-651 and 795-1 352). 

SEQ ID No. 5 (1256 bp) represents exon 28 (positions 580-664 
30 positions 3885-3969 in EMBL Accession No. X51 602) and adjacent intron sequences 

(positions l-579and 665-1256). 
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The novel introii sequence, or parts thereof, can be used, inter alia, as hybridisation 
probes to identify clones harbouring the Ht-l gene, for use in genetic linkage studies or for 
design and use as amplification primers suitable, for example, to amplify some or all of the 
tit- 1 gene using an amplification reaction such as the PCR. 
5 Polymorphism refers to the occurrence of two or more genetically determined 

alternative alleles or sequences within a population. A polymorphic marker is the site at which 
divergence occurs. Preferably markers have at least two alleles, each occurring at Irequenc}- 
of greater than 1%, and more preferably at least 10%, 15%, 20%, 30% or more of a selected 
population. 

i 0 Single nucleotide polyniorphisnis (SNP) are generally, as the name implies, single 

nucleotide or point variations that exist in the nucleic acid sequence of some members of a 
species. Such polymorphism variation within the species are generally regarded to be the 
result of spontaneous mutation throughout evolution. The mutated and normaf sequences co- 
exist within the species' population sometimes in a stable or quasi-stable equilibrium. At 

15 other times the mutation may confer some selective advantage to the species and with time 
niay be incorporated into the genomes of al l members of the species. 

Some SNPs occur in the protein coding sequences, in vvhich case, one of the 
polymorphic protein forms may possess a different amino acid vyhich may give rise to the 

J- 

expression of a variant protein and, potentially, a genetic disease. Recently, it has been 
20 reported that even polymorphisms that do not result in an amino acid change can cause 

different structural folds of mRNA vvith potentially different biological functions (Shen et aL, 
(1999) Proc Natl Acad Sci USA 96:7871-7876). Changes that occur outside of the coding 
region, i.e. intron sequences, promoter regions etc may affect the transcription and/or message 
stability of the sequences and thus aftect the level of the protein (receptor) in cells. 
25 The use of knowledge of polymorphisms to help identify patients most suited to 

therapy with particular pharmaceutical agents is often termed ''pharmacogenetics". 
Pharmacogenetics can also be used in pharmaceutical research to assist the drug selection 
process. Polymorphisms are used in mapping the human genome and to elucidate the genetic 
component of diseases. The reader is directed to the following references for background 
30 details on pharmacogenetics and other uses of polymorphism detection: Linder et al. (1997), 
Clinical Chemistry, 43:254; Marshall (1 997), Nature Biotechnology, 15: 1249; International 
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Patent Application WO 97/40462, Spectra Biomedical; and Schafer a/, (1998), Nature ' 
Biotechnology, 16:33. 

A haplotype is a set of alleles found at linked polyniorphic sites (such as within a 
y^ene) on a single (paternal or maternal) chromosome. If recombination within the gene is 
5 random, there may be as many as T haplotypes, where 2 is the number of alleles at each SNP 
and n is the number of SNPs. One approach to identifying mutations or polymorphisms 
which are correlated with clinical response is to carry out an association study using all the 
haplotypes that can be identified in the population of interest. The frequency of each 
haplotype is hmited by the frequency of its rarest allele, so that SNPs with low frequency 

10 al leles are particularly useful as markers of low frequency haplotypes. As particular 
mutations or polymorphisms associated with certain clinical features, such as adverse or 
abnormal events, are likely to be of low frequency within the population, low frequency SNPs 
niay be particularly useful in identifying these mutations (for examples see: Linkage 
disequilibrium at the cystathionine beta synthase (CBS) locus and the association between 

15 genetic variation at the CBS locus and plasma levels of homocysteine; /l/'m Hum Genet (1998) 
62:48 1-90, De Stefano V, Dekou V, Nicaud V, Chasse JF, London J, Stansbie D, Humphries 
SL, and Gudnason V; and Variation at the von willebrand factor (vWF) gene locus is 
associated with plasma vWF:Ag levels: identification of three novel single nucleotide 
polymorphisms in the vVIf gene promoter. Blood (1 999) 93:4277-83, Keightley AM, Lam 

20 YM, Brady JN, Cameron CL, Lillicrap D). 

Clinical trials have shown that patient response to drugs is heterogeneous. Thus there 
is a need tor improved approaches to pharmaceutical agent design and therapy. 

Point mutations in polypeptides will be referred to as follows: natural amino acid 
(using 1 or 3 letter nomenclature) , position, new amino acid. For (a hypothetical) example, 

25 "D25K" or ''Asp25Lys" means that at position 25 an aspartic acid (D) has been changed to 
lysine (K). Multiple mutations in one polypeptide will be shown between square brackets 
with individual mutations separated by commas. 

The present invention is based on the discovery of nine novel single nucleotide 
polymorphisms as well as novel intronic sequence of the tit- 1 gene. Relative to EMBL 

30 Accession No. X5 1602 the three novel coding sequence polymorphisms are located at 

nucleotide position: 1953, 3453 and 3888. Relative to EMBL Accession No. D640 16 the four 
novel promoter sequence polymorphisms are located at nucleotide position: 519, 786, 1422 ., 
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and 1429. Relative to SEQ ID No. 3, the intron 24 polymorphism is located at position 454 
Relative to SEQ ID No.5, the intron 28 polymorphism is located at position 696. 

For the avoidance of doubt the location ol'each of the polymorphisms (emboldened 
published allele (if published) illustrated first) and sequence immediately flanking each 
5 polymorphism site is as follows: 

Numbering according to EMBL Accession X51602 



a) Position 1953 (codon 568 polymorphism) 
GGAAAAAATGCCGACG/AGAAGGAGAGGACCTG (SEQ ID No. 6) 

10 1938 ' l%8 

b) Position 3453 (codon 1068 polymorphism) 
G.'VAATGGATGGCTCCC/TGAATCTATCTTTGAC (SEQ ID No. 7) 

3438 3468 

15 ^. 

c) Position 3888 (codon 1213 polymorphism) 
TGATGATGTCAGATAT/CGTAAATGCTTTCAAG (SEQ ID No. 8) 

3873 ■ ^903 

20 Numbering according to EMBL Accession D640 16 

d) Position 519 (promoter polymorphism) 
AAAAAGAGACGGACAG/TGCTCGCCTGGGACCT (SEQ ID No. 9) 

504 : 534 

25 

e) Position 786 (promoter polymorphism) 
GATCGGACTTTGCGCC/TGCTAGGGCCAGGCGG (SEQ ID No. 10) 

771 801 



30 0 
1407 



Position 1422 (promoter polymorphism) 
GACGGACTCTGGCGGC/TCGGGTGTTTGGCCGC 

1437 



(SEQ ID No. 1 1) 



00-004 ■ 



-6- 



g) Position 1429 (promoter polymorphism) 
TCTGGCGGCCGGGTCG/TTTGGCCGCGGGGAGC (SEQ ID No. 12)- 

1414 1434 

5 

Numbering according to Seq ID 3 (intron 24) 

h) Intron 24 position 454 
GAATGTCCTTTGGTTG/AGACAGCCTTTAGATT : (SEQ ID No. 1 3) 

!() 439 469 

Nun^bering according to Seq ID No 5 (intron 28) 

i) Intron 28 position 696 

15 AGGTACCTAGTGCACT/GCCGATAGACCGCTTG (SEQ ID No. 14) 

681 711 

According to one aspect of the present invention tiiere is provided a method for the 
diagnosis of one or more single nucleotide polymorphism(s) in ft- 1 gene in a human, which 

20 method comprises determining the sequence o f the nucleic acid of the human at one or more 
of positions: 1953, 3453, 3888 (each according to the position in EMBL accession number 
X51602), 519, 786, 1422, 1429 (each according to the position in EMBL accession number 
D64016), 454 (according to SEQ ID No. 3) and 696 (according to SEQ ID No. 5), and 
determining the status of the human by reference to polymorphism in the tit- 1 gene. 

25 The term human includes both a human having or suspected of having a fit- 1 

ligand-mediated disease and an asymptomatic human who may be tested for predisposition or 
susceptibility to such disease. At each position the human may be homozygous for an allele 
or the human may be a heterozygote. 

As defined herein, the flt-1 gene includes exon coding sequence, intron sequences 

30 intervening the exon sequences and, 3' and 5' untranslated region (3' UTR and 5' UTR) 
sequences, including the promoter element of the fit- 1 gene. 
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In one embodiment of the invention preferably the mcttuxl for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 1953 (according to the 
position in EMi3L accession number X5 1602) is the presence of G and/or A. 

In another embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism at position 3453 (according to the 
position in EMBL accession number X5 1602) is the presence of C and/or T. 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 3888 (according to the 
position in EMBL accession number X5 1602) is the presence of T and/or C: 
10 In another embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism at position 519 (according to the 
position in EMBL accession number D640 16) is the presence of C and/or T. 

In anodier embodiment of the invention preferably the niethod for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 786 (according to ihc 
I 5 position in EMBL accession number D640 16) is the presence of G and/or 

In another embodiment of the invention preferably the method for diagnosis described 
herein is one in vvhich the single nucleotide polymorphism at position 1422 (according to the 
position in EMBL accession number D6401 6) is the presence of G and/or T. 

In anothereiiibodiment of the invention preterably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism at position 1429 (according to the 
position in EiVIBL accession number D6401 6) is the presence of G and/or T. 

hi another embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism at position 454 (according to the 
position in SEQ ID No. 3) is the presence of G arid/or A, 
25 In another embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism at position 696 (according to the 
position in SEQ ID No. 5) is the presence of T and/or C. 

The method for diagnosis is preferably one in which the sequence is determined by a 
method selected from amplification refractory mutation system (ARMS'^^-allele specific 
30 amplification), allele specific hybridisation (ASH), oligonucleotide ligation assay (OLA) and 
restriction fragment length polymorphism (RFLP); 
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In another aspect of the invention there is provided a method of analysing a nucleic 
acid, comprising: obtaining a nucleic acid from an individual; and determining the base 
occupymg any one of the following polymorphic sites; 1953, 3453, 3888 (each according to 
the position in EMBL accession number X51602), 519, 786, 1422, 1429 (each according to 
5 the position in EMBL accession number D64016), 454 (according to SEQ ID No. 3) and 696 
(according to SEQ ID No. 5). 

In another aspect of the invention we provide a method for the diagnosis of tit- 1 
ligand-mediated disease, which method comprises: 
i) obtaining sample nucleic acid from an individual; 

10 ii) detecting the presence or absence of a variant nucleotide at one or more of positions: I 953. 
3453, 3888 (each according to the position in EMBL accession number X5 1602), 519, 786. 
1422, 1429 (each according to the position in EMBL accession number D640 16), 454 
(according to SEQ ID No. 3) and 696 (according to SEQ ID No. 5), in the tit- 1 gene: and, 
iii) determining the status of the individual by reference to polymorphism in the lit- 1 gene, 

15 Allelic variation at position 1953 (according to EMBL sequence X5 J 602) consists of a 

single base substitution from G (the published base), for example to A. Allelic variation at 
position 3453 (according to EMBL sequence X5 1602) consists of a single base substitution 
from C (the published base), for example to J. Allelic variation at position 72 (according to 
EMBL sequence X5 1602) consists of a single base substitution from T (the published base), 

20 for example to C\ Allelic variation at position 5 19 (according to EMBL sequence D64016), 
consists of a single base substitution from C (the published base), for example to T. Allelic 
variation at position 786 (according to EMBL sequence D640 16), consists of a single base 
substitution from C (the published base), for example to T. Allelic variation at position 1422 
(according to EMBL sequence D640 16), consists of a single base substitution from C (the 

25 published base), for example to T. Allelic variation at position 1429 (according to EMBL 
sequence D64016), consists of a single base substitution from G (the published base), for 
example to T. Allelic variation at position 454 (according to SEQ ID No. 3) consists of a 
single base substitution from C to G, for example. Allelic variation at position 454 (according 
to SEQ ID No. 5) consists of a single base substitution from T to C, for example. 

30 The invention resides in the identification of the existence of different alleles at 

particular loci. The status of the individual may be determined by reference to allelic 
variadon^at one or more of the above loci. 
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The test sample of nucleic acid is conveniently a sample of blood, bronchoalveolar 
lavage tluid, sputum, urine or other body fluid or tissue obtained from an individual. It will 
be appreciated that the test sample may equally be a nucleic acid sequence corresponding to 
the sequence in the test sample, that is to say that ail or a part of the region in the sample 
5 nucleic acid may firstly be amplified using any convenient technique e.g. PGR, before use in 
the anah'sis of sequence variation. 

It will be apparent to the person skilled in the art that there are a large number of 
analytical procedures which may be used to detect the presence or absence of one or more of 
the polymorphisms identified herein, hi general, the detection of allelic variation requires a 

!(,) mutation discrimination lechnique, optionally an amplification. reaction and a signal 

generation system. Table 1 lists a number of mutation detection techniques, some based on 
the PGR. These may be used in combination with a number of signal generation systems, a 
selection of which is listed in 'fable 2. Further amplification techniques are listed in fable 3. 
iVIany current methods for the detection of allelic variation are reviewed by Nollau cv ciL. Clin. 

15 Chem. 43, 11 14-1 120, 1997; and in standard textbooks, for example "Laboratory Protocols 
for Mutation FJetection", Ed. by U. Landegren, Oxford University Press, 1996 and "PCR'\ 2'^^ 
l^dition by Newton & Graham, BIOS Scientific Publishers Limited, 1997. 



Abbreviations: | 



ALEXT-^' 


Amplifieation refractory mutation system linear extension 


APEX 


Arrayed primer extension 


ARMST-^' 


Amplification refractory mutation system 


ASH 


Allele specific hybridisation 


b-DNA 


Branched DNA 


CMC 


Chemical mismatch cleavage 


bp 


base pair 


COPS 


Competitive oligonucleotide priming system 


DGGE 


Denaturing gradient gel electrophoresis 


FRET 


Fluorescence resonance energy transfer 


LCR 


Ligase chain reaction 


MAS DA 


Multiple allele specific diagnostic assay 
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NASBA 


Nucleic acid sequence based amplification 


nt-1 


VEGF receptor- 1 


OLA 


Oligonucleotide ligation assay 


PGR 


Polymerase chain reaction 


PTT 


Protein truncation test 


RFLP . 


Restriction fragment length polyniorphtsm 


SERRS 


Surface enhanced ranaan resonance spectroscopy 


SDA 


Strand displacement amplification 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand conformation polymorphism analysis 


SSR 


Self sustained replication 


TGGE 


Temperature gradient gel electrophoresis 



! able 1 ■ [Vlutation Detection Techniques 

General: DNA sequencing. Sequencing hy hybridisation 

Scanning: PTT*, SSCP, DGGE, TGGE;Cleavase. Heteroduplex analysis, CM^^ 
5 niismalch cleavage 

^'^ Note: not uselul tor detection of proaioter polymorphisms. 
Hybridisation Based 

Solid phase hybridisation: Dot blots, MAS DA, Reverse dot blots. Oligonucleotide 
arrays (DNA Chips) 

10 Solution phase hybridisation: Taqman™ - US-5210015 & US-5487972 (Hoftmann-La 

Roche), Molecular Beacons - Tyagi et cil (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst., New York), ASH 

Extension Based: ARMS™-allele specific amplification (as described in European patent 
No. EP-Bo32435 and US patent No. 5,595,890), ALEX™ - European Patent No. EP 332435 
15 81 (Zeneca Limited), COPS - Gibbs c-/ a/ (1989), Nucleic Acids Research, 17, 2347. 
Incorporation Based: Mini-sequencing, APEX 
Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA- Nickerson et aL (1990) P.N,A-S. 87:8923-8927. 
Other: Invader assay 



00-004 ' 

-11- 

Tahlc 2 - Sienal Generation or Detection Systems 

Fluorescence: FRET, Fluorescence quenching, Fluorescence polarisalion - United Kingdoin 
Patent No. 2228998 (Zeneca Limited) 
5 Other: Chemiluminescence, Electrocheniiluminescence, Raman, Radioactivity, Colorimetric, 
Kybridisation protection assay, Mass spectrometry, SERRS - WO 97/05280 (University of , 
Strathclyde). 

Table 3 - Further Amnlification Methods 

10 SSR, NASBA, LCR, SDA, b-DNA 

Preferred mutation detection techniques include ARMS'''''^'-allele specific 
amplification, ALEX™, COPS, Taqman, Molecular Beacons, RFLP, OLA, restriction site 
based PGR and FRET techniques. 

Particularly preferred methods include ARMS™-alIele specific amplification, OLA 

1 5 and RFLP based methods. The allele specific ampLification technique known in the art as 
ARFvIS'^"^' is an especially preferred method. 

ARMS™-allele specific amplification (described in European patent No. EP-B- 
332435, US patent No. 5,595,890 and Newton et al. (Nucleic Acids Research, Vol. 17, 
p. 2503; 1989)), relies on the complementarity of the 3' terminal nucleotide |3f the primer and 

20 its template. The 3 ' terminal nucleotide of the primer being either complementary or non- 
complementary to the specific mutation, allele or polymorphism to be detected. There is a 
selective advantage for primer extension from the primer whose 3' terminal nucleotide 
complements the base mutation, allele or polymorphism. Those primers which have a 3' 
terminal mismatch with the template sequence severely inhibit or prevent enzymatic primer 

25 extension. Polymerase chain reaction or unidirecfional primer extension reactions therefore 
result in product amplification when the 3' terminal nucleotide of the primer complements 
that of the template, but not, or at least not efficiently, when the 3' terminal nucleotide does 
not complement that of the template. 

Therapeutic opportunities for VEGF receptor antagonists exist for angiogenic and 

30 cancer diseases. An example of a knovvn inhibitor of fit- 1 is SU54 16 (57//7ra). 



f - ■ r 

00-004 V 



■12- 



In a further aspect, the diagnostic methods of the invention are used to assess the 
efficacy of therapeutic compounds in the treatment of angiogenic diseases, such as diabetic 
retinopathies, psoriasis, rheumatoid arthritis and endometriosis, and cancer. 

The polymorphisms identified in the present invention that occur in intron regions or 
5 in the promoter region are not expected to alter the amino acid sequence of the flt-1 receptor, 
but may alYect the transcription and/or message stabiHty of the sec]uences and thus aiTect the 
level of the receptors in cells. 

Assays, for example reporter-based assays, may be devised to detect whether one or 
more of the above polymorphisms affect transcription levels and/or message stability. 
10 Individuals who carry particular allelic variants of the ttt- 1 gene, especially those 

within the promoter element, may therefore exhibit differences in receptor levels under 
different physiological conditions and will display altered abilities to react to different 
, diseases. In addition, differences in receptor level arising as a result of allelic variation may 
have a direct effect on the response of an individual to drug therapy. Fit- 1 polymorphism may 
15 therefore have the greatest effect on the efficacy of drugs designed to modulate the activity of 
the lit- 1 . However, the polymorphisms may also affect the response to agents acting on other 
biochemical pathways regulated by a tlt-1 ligand. The diagnostic methods of the invention 
may therefore be useful both to predict the clinical response to such agents and to determine 
therapeutic|dose. 

20 In a further aspect, the diagnostic methods of the invention, aLre used to assess the 

predisposition and/or susceptibility of an individual to diseases mediated by an flt-l ligand. 

Fit- 1 gene polymorphism may be particularly relevant in the development of diseases 
modulated by an tlt-1 ligand. The present invention may be used to recognise individuals who 
are particularly at risk from developing these conditions. 

25 In a further aspect, the diagnostic methods of the invention are used in the 

development of new drug therapies which selectively target one or more allelic variants of the 
tlt-l gene. Identification of a link between a particular allelic variant and predisposition to 
disease development or response to drug therapy may have a significant impact on the design 
of new drugs. Drugs may be designed to regulate the biological activity of variants implicated 

30 in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleoticles is detected by reference to the loss or gain of optionally engineered, sites 
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recognised by restriction enzynrj:]. Tor o.^jmple :he yo\ymocplu:..\\ ... position 3888 cluu alters 
the third base of codon !213 can be detected by digestion with the restriction enzyme Sna 1 B. 
as polymorphism at this position creates a Sna IB recognition sequence (TACGTA). 

Engineered sites include those wherein the primer sequences employed to amplify the 
5 target sequence participates along with the nucleotide polymorphism to create a restriction site 
For example, the polymorphisn^ at position 519 can be detected by diagnostic engineered 
IIFLP digestion with the restriction enzyme Sph 1, since modification of position 516 creates 
a potential Sph 1 I recognition sequence (GCATGC). Polymorphism at position 519 will 
modify the recognition sequence (GCAC/TGC). 
fO The person of ordinary skiH will be able to design and implement diagnostic 

pi-occdures based on the detection of restriction fragment length polymorphism due U) the Ions 
or gain of one or more of the sites. ' 

According to another aspect of the present invention there is provided a nucleic acid 
comprisinganyoneofthe following polymorphisms: 
I 5 the nudeicjicid disclosed in EMBL Accession dumber X5 f602 with A at position 1953 
according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number X5 1 602 with T at position 
3453 according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EiVlBL Accession Number X5 1602 with C at position 
20 3888 according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number D64016 with T at position 
5 19 according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number D64016 with T at position 
786 according to the nucleotide positioning therein; 
25 the nucleic acid sequence disclosed in EMBL Accession Number D64016 with T at position 
1422 according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number D640 1 6 with T at position 
1429 according to tiie nucleotide positioning therein; 

the nucleic acid sequence disclosed in SEQ ID No. 3 with G at position 454 accordingjo the 
30 nucleotide positioning therein; 

the nucleic acid sequence disclosed in SEQ ID No, 3 with A at position 454 according, to the " ' 
nucleotkie positioning therein; 
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the nucleic acid sequence disclosed in SEQ ID No. 5 with T at position 696 ciccording to the 
nucleotide positioning therein; 

the nucleic acid sequence disclosed in SEQ ID No. 5 with C at position 696 according to the 

nucleotide positioning therein; 
5 or a complementary strand thereof or a fragment thereof of at least 17 bases comprising at 

least one of the polymorphisms- 
According to another aspect of the present invention there is provided an isolated 

nucleic acid comprising at least 17 consecutive bases of fit- 1 gene said nucleic acid 

comprising one or more of the following polymorphic alleles: A at position 1953 (according 
10 to X51602); T at position 3453 (according to X51602), C at position 3888 (according to 

X5 1602), T at position 519 (according to D64016)/T at position 786 (according to D64()16), 

T at position 1422 (according to D64016), T at position 1429 (according to D64016), A at 

position 454 (according to SEQ ID No. 3) and C at position 696 (according to SEQ ID No. 5) 

or a complementary strand thereof 
15 Fragments are at least 17 bases more preferably at least 20 bases, more preferably at 

least 30 bases. 

The invention further provides nucleotide primers which detect the tit- 1 gene 
polymorphisms o f the invention. Such primers can be of any length, for example between 8 
and 1 00 nucleotides in length, but will preferai)|3ly be between 12 and 50 nucleotides in length. 

20 more preferable between 17 and 30 nucleotides in length. 

According to another aspect of the present there is provided an allele specific primer 
capable of detecting an fit- 1 gene polymorphism at one or more of positions: 1953, 3453, 
3888 (each according to the position in EMBL accession number X5 1602), 519, 786, 1422, 
1429 (each according to the position in EiVIBL accession number D640 16), 454 (according to 

25 SEQ ID No. 3) and 696 (according to SEQ ID No. 5). 

An allele specific primer is used, generally together with a constant primer, in an 
amplification reaction such as PGR, which provides the discrimination between alleles 
through selective amplification of one allele at a, particular sequence position e.g. as used for 
ARMS'^^* allele specific amplification assays. The allele specific primer is preferably 17- 50 

30 nucleotides, more preferably about 17-35 nucleotides, more preferably about 17-30 
nucleotides. 
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An allele specific primer preferably corresponds exactly vvitli the allele to be detected 
hut derivatives thereof are also contemplated wherein about 6-(S of the nucleotides at the 3^ 
terminus correspond with the allele to be detected and wherein up to I 0, such as up to 8, 6, 4, 
2, or 1 of the remaining nucleotides may be varied without significantly affecting the 

5 properties of the primer. Often the nucleotide at the -2 and/or -3 position (relative to the 3' 
terminus) is mismatched in order to optimise differential primer binding and preferential 
extension from the correct allele discriminatory primer only 

Primers may be manufaetured using any convenient method of synthesis. Examples of 
such methods may be found in standard textbooks, for example "Protocols for 

0 Oligonucleotides and Analogues; Synthesis and Properties/' Methods in Molecular Biology 
Scries; Volume 20; f d. Sudhir Agravvai J-Iuniana ISFW: 0-89603-247--7; I9Q3; V' fdition. If 
required the primer(s) may be labelled to faeilitate detection. 

According to another aspect of the present invention there is provided an allele- 
specific oligonucleotide probe capable of detecting a tit- 1 gene polymorphism of the 

5 invention. 

According to another aspect of the present invention there is provided an allele- 
specific oligonucleotide probe capable of detecting an tlt-l gene polymorphism at one or more 
of positions: 1953; 3453, 3888 (each according to the position in EMBL accession number 
X5 1 602), 5 1 9, 786, 1 422, 1429 (each according to the position in EMBL accession number 
0 D64016), 454 (according to SEQ ID No. 3) and 696 (according to SEQ ID Ko. 5), in the tlt-l 
gene. 

The ailele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 
preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill. 

5 Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
general such probes will comprise base sequences entirely complementary to the 
corresponding wild type or variant locus in the gene. However, if required one or more 
mismatches may be introduced, provided that the discriminatory power of the oligonucleotide 

0 probe is not unduly affected. Suitable oligonucleotide probes might be those consisting of or 
conaprising the sequences depicted in SEQ ID Nos. 6 - 14 possessing one or other of the 
central allelic base differences (emboldened), or sequences complementary thereto. The 



00-004 

-16- 

probes or primers of the invention may can'y one or more labels to facilitate detection, such a'^' 
in iVIolecular Beacons. 

According to another aspect of the present invention there is provided a diagnostic kit 
comprising one or more allele-specific primers of the invention and/or one or more allele- 
5 specific oligonucleotide probe of the invention. 

llie diagnostic kits may comprise appropriate packaging and instructions for use in the 
methods of the invention. Such kits may further comprise appropriate buffer(s) and 
polynierase(s) such as thermostable polymerases, for example taq polymerase. Such kits ma\- 
also comprise companion primers and/or control primers or probes. A companion primer is 
I 0 one that is part of the pair of primers used to perform PCR. Such primer usually complements 
the template strand precisely. 

In another aspect of the invention, the single nucleotide polymorphisms of tiiis 
invention may be used as genetic markers for this region in linkage studies. This particularly 
applies to the polymorphisms at positions 3453, 3888 (both according to the position in 
15 LiiVIllh Accession No. X5 1602), position 1429 (according to the position in EMBL accession 
tiumber D64016), position 454 (according to the position in SEQ ID No. 3) and position 696 
(according to the position in SEQ ID No. 5) because of their relatively high frequency. Those 
polymorphisms that occur relatively infrequently are useful as markers of low frequency 
haplotypes. I 
20 According to another aspect of the present invention there is provided a method of 

treating a human in need of treatn^ient with an lit- 1 ligand antagonist drug in which the method 
comprises: 

i) diagnosis of a single nucleotide polymorphism in tit- 1 gene in the human, which 
diagnosis comprises determining the sequence of the nucleic acid at one or more of positions: 

25 1953, 3453, 3888 (each according to the position in EMBL accession number X5 1602), 519, 
786, 1422, 1429 (each according to the position in EMBL accession number D640 16), 454 
(according to SEQ ID No. 3) and 696 (according to SEQ ID No. 5), and determining the status 
of the human by reference to polymorphism in the flt-l gene; and 

ii) administering an effective amount of an tlt-l ligand antagonist drug. 

30 
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Preferably determination of the status of the human is clinically useful. Examples of 
clinical usefulness include deciding which flt-1 ligand antagonist drug or drugs to administer 
and/or in deciding on the effective amount of the drug or drugs. 

According to another aspect of the present invention there is provided use of an tit-l 
5 ligand antagonist drug in the preparation of a medicament for treating a VEGF-mediated 
disease in a human diagnosed as having a single nucleotide polyniorphism at one or more of 
positions: 1953, 3453, 3888 (each according to the position in EMBL accession number 
X5 1602). 519, 786, 1422, 1429 (each according to the position in EMBL accession number 
D64016). 454 (according to SEQ ID No. 3) and696 (according to SEQ ID No. 5), in the fit- 1 
10 gene. 

According to another aspect of the present in\'ention there is pro\'ided a 
pharmaceutical pack comprising an llt-l ligand antagonist drug and instructions for 
administration of the drug to humans diagnostically tested for a single nucleotide 
polyiuorphism iu one or more of positions: 1 953, 3453, 3888 (each according to llie position 

15 in EMBL accession number X5 1602), 5 1 9, 786, 1 422, 1 429 (each according to the poshion in 
EMBL accession number D64016), 454 (according to SEQ ID No. 3) and 696 (according to 
SEQ ID No. 5), in the fit- 1 gene. 

According to another aspect of the invention there is provided an isolated nucleic acid 
sequenceeomprising the seq|ience selected from the group consisting of: 

20 (i) the nucleotide sequence froni positions 1-482 of SEQ ID No, 1 ; 

(ii) the nucleotide sequence from positions 616-1073 of SEQ ID No. I: 

(iii) the nucleotide sequence from positions 1-437 of SEQ ID No. 2; 

(iv) the nucleotide sequence from positions 595-1024 of SEQ ID No. 2; 

(v) the nucleotide sequence from positions 1123-1480 of SEQ ID No. 2; 
25 (vi) the nucleotide sequence from positions 1-266 of SEQ ID No. 3; 

(vii) the nucleotide sequence from positions 279-726 of SEQ ID No. 3; 

( viii) the nucleotide sequence from positions 1-284 of SEQ ID No. 4; 

(ix) the nucleotide sequence from positions 391-651 of SEQ ID No. 4; 

(x) the nucleotide sequence from positions 795-1352 of SEQ ID No. 4; 
30 (xi) the nucleotide sequence from, positions 1-579 of SEQ ID No. 5; 

(xii) the nucleotide sequence from positions 665-1256 of SEQ ID No. 5; 
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(xiii) a nucleotide sequence having at least 80%, preferably at least 90%, sequence identity 
to a sequences (i) - (xii); 

(xiv) an isolated fragment of (i) - (xiii); and 

(xv) aniucleotide sequence fully complementary to (i) - (xiv). 

5 In the above, group (xiii) relates to variants of the polynucleotide depicted in groups 

( i) - (xii). The variant of the polynucleotide may be a naturally occurring allelic variant, from 
the same species or a different species, or a non-naturally occurring allelic variant. As kuovvn 
in the art an allelic variant is an alternate form of a polynucleotide sequence which may have a 
deletion, addition or substitution of one or more nucleotides. 

10 Sequence identity can be assessed by best-fit computer alignment analysis using 

suitable software such as Blast, Blast2, FastA, Fasta3 and PILEUP. Preferred software for use 
in assessing the percent identity, i.e how two polynucleotide sequences line up is PILEUP, 
Identity refers to direct matches. In the context of the present invention, two polynucleotide 
sequences with 90% identity have 90%) of the nucleotides being identical and in a like 

15 position vvhen aligned optimally allowing for up to 10, preferably up to 5 gaps. The present 
invention particularly relates to poiynucleotides vvhich hybridise to one or other of the 
polynucleotide sequences (i) - (xv), under stringent conditions. As used herein, stringeift 
conditions are those conditions which enable sequences that possess at least 80%), preferably 
at least 90%, more preferably at least 95%) and more preterably at least 98% sequence identity 

20 to hybridise together. Thus, nucleic acids which can hybridise to one or other of the nucleic 
acids of (I)> (xv), include nucleic acids which have at least 80%, preferably at least 90%, 
more preferably at least 95%, even more preferably at least 98% sequence identity and most 
preferably 100%o, over at least a portion (at least 20, preferably 30 or more consecutive 
nucleotides) of the polynucleotide sequence of (i) - (xv) above. 

25 As well as the novel intron sequences depicted in SEQ ID Nos. 1 - 5, smaller nucleic 

acid fragments thereof useful for example as oligonucleotide primers to amplify the flt-1 gene 
sequences or identify SNPs using any of the well known amplification systems such as the 
polymerase chain reaction (PGR), or fragments that can be used as diagnostic probes to 
identify corresponding nucleic acid sequences are also part of this invention. The invention 

30 thus includes polynucleotides of shorter; length than the novel intron flt-1 sequences depicted 
in SEQ ID Nos. 1 - 5 that are capable of specifically hybridising to the sequences depicted 
herein. Such polynucleotides tnay be at least 17 nucleotides in length, preferably at least 20, 
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niore preferably at least 30 nucleotides in length and may be of any size up to and including or 
indeed, comprising the complete intron sequences depicted in SEQ ID Nos. 1 - 5. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on 
a nylon membrane and the probe nucleic acid is greater than 300 bases or base pairs, say 500 
5 bp, is: 6 x SSC (saline sodium citrate), 0.5% SDS (sodium dodecyl sulphate), lOOug/ml 
denatured, sonicated salmon sperm DNA, An example of a suitable hybridisation solution 
when a nucleic acid is immobilised on a nylon membrane and the probe is an oligonucleotide 
of between 12 and 50 bases is: 3M trimethylammonium chloride (TMACl), 0.01 iVl sodium 
phosphate (pH 6.8), ImiVI EDTA (pH 7.6), 0.5% SDS, 100|.ig/ml denatured, sonicated salmon 

10 sperm DNA and 0.1% dried skimmed milk. The hybridisation can be performed at 68''C tor 
at least 1 hour and the filters then washed at 68°C in I x SSC, or for higher stringency, 0,1 x 
SSCVO.1% SDS. l lybridisation techniques are W'ell advanced in the art. The person skilled in 
the art wil l be able to adapt the hybridisation conditions to ensure hybridisatibn of sequences 
with 80%, 90% or more identity. 

15 A fragment can be any part of the full length sequence anci niay be single or double 

stranded or may comprise both single and double stranded regions. In a preferred 
embodiment; a fragment i.s a restriction enzyme fragment. 

The nucleic acid sequences of the invention, particularly those relating to and 
identifying the single nucleotide polymorphisms identi tied he|ein represent a valuable 

20 information source with which to identity further sequences of similar identity and 

characterise individuals in terms of, for example, their identity, haplotype and other sub- 
groupings, such as susceptibility to treatment with particular drugs. These approaches are 
most easily facilitated by storing the sequence information in a computer readable medium 
and then using the information in standard macromolecular structure programs or to search 

25 sequence databases using state of the art searching tools such as GCG (Genetics Computer 
Group), BlastX BlastP, BlastN, FASTA (refer to Altschul et al. J. Mol. Biol, 215:403-4 10, 
1 990). Thus, the nucleic acid sequences of the invention are particularly useful as 
components in databases useful for sequence identity, genome mapping, pharmacogenetics 
and other search analyses. Generally, the sequence information relating to the nucleic acid 

30 sequences and polymorphisms of the invention may be reduced to, converted into or stored in 
a tangible medium, such as a computer disk, preferably in a computer readable form. For 
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exaniple, chromatographic scan data or peak data, photographic scan or peak data, mass 
spectrographic data, sequence gel (or other) data. 

The invention provides a computer readable medium having stored thereon one or 
more nucleic acid sequences of the invention. For example, a computer readable medium is 
provided comprising and having stored thereon a member selected from the group consisting 
of; a nucleic acid comprising the sequence of a nucleic acid ofthe invention, a nucleic acid 
consisting of a nucleic acid of the invention, a nucleic acid vvhich comprises part of a nucleic 
acid ofthe invention, which part includes at least one ofthe polymorphisms ofthe invention, 
a set of nucleic acid sequences wherein the set includes at least one nucleic acid sequence of 
the invention, a data set comprising or consisting of a nucleic acid sequence ofthe invention 
or a part thereof comprising at least one of the polymorphisms identified herein. The 
computer readable medium can be any composition of matter used to store information or 
data, including, for example,, floppy disks, tapes, chips, compact disks, digital disks, video 
disks, punch cards and hard drives. 

In another cispect of the invention there is provided a computer readable medium 
having stored thereon a nucleic acid sequence comprising at least 20 consecutive bases of the 
uPAR gene sequence, which sequence includes at least one of the polymorphisms at positions 
1 953, 3453, 3888 (each according to the position in EMBL accession number X5 1602), 519, 
786, 1422, 1429 (each according to the position in EMBL accession number D640 16), 454 
(according to SEQ ID No. 3) and 696 (according to SE^ 

In another aspect of the invention there is provided a computer readable medium 
having stored thereon a nucleic acid coniprising any ofthe intron sequences disclosed in any 
of SEQ IDNos. 1 - 5. 

A computer based method is also provided for performing sequence identification, 
said method comprising the steps of providing a nucleic acid sequence coniprising a 
polymorphism of the invention in a computer readable medium; and comparing said 
polymorphism containing nucleic acid sequence to at least one other nucleic acid or 
polypeptide sequence to identify identity (homology), i.e. screen for the presence of a 
polymorphism. Such a method is particularly useful in pharmacogenetic studies and in 
genome mapping studies. 

In another aspect of the invention there is provided a method for performing sequence 
identification, said method comprising the steps of providing a nucleic acid sequence 
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comprising at least 20 consecutive bases of' the mPAR gene sequence, which sequence 
includes at least one of the polymorphisms at positions: 1953, 3453, 3<S88 (each according to 
the position in EMBL accession number X5 1602)/5 19, 786, 1422, 1429 (each according to 
the position in EMBL accession number D64016), 454 (according to SEQ ID No. 3) and 696 
5 (according to SEQ ID No. 5) in a computer readable medium; and comparing said nucleic 
acid sequence to at [east one other nucleic acid sequence to identify identity. 

In another aspect of the invention there is provided a method for performing sequence 
identification, said method comprising the steps of providing one or more of the following 
polymorphism containing nucleic acid sequences: 
10 the nucleic acid disclosed in EMBL Accession Number X5 1602 with A at position 1953 
according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number X51602 with Tat position 
3453 according to, the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number X5 1602 with C at position 
15 3888 according to die nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL AccessionNumber D640 1 6 with T at position 
519 according to the nucleotide positioning therein; 

the nucleic acid sequence disclosed in EMBL Accession Number D640 1 6 with T at position 
786 according to the nucleotide positioning therein; | 
20 the nucleic acid sequence disclosed in EiVIBL Accession Number D640 16 with T at position 
1422 according to the nucleotide positioning therein; : 

the nucleic acid sequence disclosed in EMBL Accession Number D640 16 with T at position 
1429 according to the niicleotide positioning therein;^ 

the nucleic acid sequence disclosed in SEQ ID No. 3 with G at position 454 according to the 
25 nucleotide positioning therein; 

the nucleic acid sequence disclosed in SEQ ID No. 3 with A at position 454 according to the 
nucleotide positioning therein; 

the nucleic acid sequence disclosed in SEQ ID No. 5 withT at position 696 according to the 
nucleotide positioning therein; 
30 the nucleic acid sequence disclosed in SEQ ID No. 5 with C at position 696 according to the 
nucleotide positioning therein; 
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or a complementary strand thereof or a fragment thereof of at least 1 7 bases comprising at ' 
least one of the polymorphisms, and comparing said nucleic acid sequence to at least one 
other nucleic acid or polypeptide sequence to determine identity. 

The invention will now be illustrated but not limited by reference to the following 
5 Examples. All temperatures are in degrees Celsius. , 

In the Examples below, unless otherwise stated, the following methodology and 
n^aterials have been applied. 

AMPLrrAQ^^ available from Perkin-Elmer Getusjs used as the so 
thermostable DNA polymerase. 
10 General molecular biology procedures can be followed from any of the methods 

described in/^Molecular Cloning - A Laboratory Manuar' Second Edition, Sambrook, Fritsch 
and Maniatis (Cold Spring Harbor Laboratory, 1989). 

Electropherograms were obtained in a standard manner: data vvas collected by ABI3 77 
data collection so;llware and the wave forni generated by ABi 
l-5." -(2,1.2), ■ 

: . EXAMPLES 

Example I - Identification of Polvmornhisms 
. A. Methods 

20 In view of the large size of the fIt-1 gene, scanning f^ 

the fit- 1 gene. The suh-cloning strategy and primer positions used is disclosed below. 
DNA Preparation 

DNA was prepared from fr02:en blood samples collected in I 
(Molecular Cloning: A Laboratory M^^ 

25 Edition, Cold Spring Harbor Press, 1989) with the following modifications. The thawed 
blood was diluted in an equal volume of standard saline citrate instead of phosphate buffered 
saline to remove lysed red blood cells. Samples were extracted with phenol, then 
phenol/chloroform and then chloroform rather than with three; phenol extractions. The DNA 
was dissolved in deionised water. 

30 Template Preparation 

Templates were prepared by PGR using the oligonucleotide primers and annealing 
temperatures set out below. The extension temperature was 72° and denaturation temperature 
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Generally 50 ng of genomic DNA was used in each reaction and subjected to 35 c^'clos 
of PCR. 

Dye Primer Sequencing 

Dye-primer sequencing using M13 forward and reverse primers was as described in 
5 the ABl protocol P/N 4021 14 for the ABI Prisni^^^'^' dye primer cycle sequencing core kit with 
"AmpliTaq FS'^™ DNA polymerase, modified in that the annealing temperature was 45° and 
DiVlSO was added to the cycle sequencing mix to a final concentration of 5 %. 

The extension reactions for each base were pooled, ethanol/sodium acetate 
precipitated, vv^ished and resuspended in formamide loading buffer. 

4.25 % Acrylamide gels vvere run on an automated sequencer (ABl 377, Applied 
Biosystems). 

B. Results 
Primer design 
5 L Primer lociUions tor scan orCoding rcg! 

All locations in this section refer to EMBt Acc^^^ 
EMBL Accession Number X51602 , 7680 bp 
5' UTR (1-249), Coding (250-4266), 3'UTR (4267-7680) 
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Products requiring t\vo stage amplification from c DNA 






Primary product 








Product Fonvard Primer 


Revcrsc Primer 


Temp 


Time 


1777-3946 1777-1804 


3919-3946 


55o 


3 mill 


Secondary Products (Primary product diluted 1000 x) 






Product Forward Primer 


Reverse Primer 


Temp 


r i m c 


a. 1854-2435 1854-1877 


2412-2435 


58o 


90 sec 


b. 2288-^2879 2288-23 11 


2857-2879 


58o 


90 sec 


c. 272j-jj10 2723-2746 


j288-j310 


58o 


90 sec 


a. 3157-3748 3 157-3 180 


3725-3748 


58o 


^ ■ 90' sec 


Products amplified directly from cDiNA 








Product Korwaru rnmer 
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Temp 


Time 


e. 293-696 292-313 


673-696 


55o 


90 sec 


f. 564-1 133 564-587 


1110-1133 


55o ' 


; 90 sec 


g. 1031-1626 1031-1054 


1603-1626 


55o 


90 sec 


h. 1491-2046 1491-15 L4 


2023-2046 


55o 


90 sec: 


i. 3662-4249 3662-3682 


4226-4249 


35o.. 1 


90 sec 


Products amplified frorn genomic DNA 








Product Forward Primer 


Reverse Primer 


Temp 


Time 


j..4;l 63^-4744 4163-4182 


4721-4744 , 


55o 


90 sec 


2. Primer locations tor scan of promoter, 5' UTR, exon 1 






All locations in this section refer to EMBL Accession Number D640 16 


EMBL Accession Number D64016, 1745 bp 






Promoter region, exon 1, intron l 








Product Forward Primer 


Reverse Primer 


Temp 


Time 


Ic. 14-479 14-34 


456-479 


55g 


90 sec 



00-004 



1. 343-890 
m. 762-1251 
n. 115 1-1694 



343-366 
762-78 1 
1 151-1 172 



-25- 

869-890 

1232-1251 

1673-1694 



55o 
55o 



90 sec 
90 sec 
90 sec 



3 For dye-primer sequencing these primers were modified to include the M 1 3 forward 

and reverse primer sequences (ABI protocol P.'N 4021 14, Applied Biosystems) at the 5" end 
of the forward and reverse oligonucleotides respectively. 

Novel Polymorphisms 
10 Novel Polymorphisms within coding region - numbering refers to EMBL Accession 
Number X5 1602 

IN)sition Polyrnorpliism /\.llelc Frequency No of Individuals 

1953 G/A G90%A10% 31 

1:5 ■■. . . ■ 

Polymoiphisni at position 1953 alters th 568 (Threonine - 

ACO/ACA), [t has: been shown that single nucleotide polyniorphisms ca^i^^^ 
structural folds of mRNA with potentially different biological fLinctions (Shea et al i 999. 
//?/c/).11ie pol|morphisiii can be detected by a diagnostic e RFLP since engineering of 
20 positions 1 949,1950 creates a BsiWI recognition sequence (GGTAGG). Polymorphism at 
position i 953 will modify the recoirnition sequence (GGTAC G/A 1. 

Diagnostic primer (positions 1919-1952 in X51602) 

ATGGGTTTCATGTTAACTTGGAAAAAATGGGTAC 
25 modified residues in bold underline 

Reverse primer (positions 2098-2125 in X51602) 

G ATTG ATG ATGGT A AG ATT A AG AGTG AT 

Amplification of genomic DNA with these primers will generate a PGR product of 
30 206 bp. Digestion of a product from a wild type template with Bsi WI (New England 

B iolabs) will give rise to products of I 68bp and 3 8 bp. Digestion of a heterozygote product 
will geaerate products of 206 bp, 168 bp and 38 bp. A product generated from a homozygote 
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variant will not be digested by BsiWL Products can be separated and visualised on agarose 
gels following standard procedures (i.e. iVIolecular Cloning: Sanibrook et al.. 1989, ihicf). 



(2) 

5 Position Polymorphism Allele Frequency iVo of Individuals 

3453 C/T C70% T 30% 23 

Polymorphism at position 3453 alters the third base oFcodon 1068 (Proline - 
CCC/CCT), It has been shown that single nucleotide polymorphisms can cause different 
1(3 structural folds of mRNA with potentially different biological functions (Shen et al 1999, 
//; /<:/). The polymorphism at position 3453 can be detected by a diagnostic e RFLP, since 
niodiricacion of positions 3455. 3456, 3457 creates a PstI recognition sequence (CTGGAG). 
Polymorphism at position 3453 vvill modify the recognition sequence (GTGCA/TG). 

15 Diagnostic primer (Reverse^ positions 3487-3454 in X51602, equivalent to positions 330- 

^' 297 in Seq XD No:3).. ■ : ^ 

TGTTGGTTGGTGTAGATTTTGTCAAAGATAGCTGC 

tViodified residues in bold underline 

Forward primer (position 193-216 in Seq II) No 3) 

20 AGCCCATGGACACTCGGGTTGA 

Amplificatipn of genomic DNA with these primers will generate a PGR product of 
137 bp. A product generated from a wild type template will not be digested by Pstf (New 
England Biolabs). Digestion of a heterozygote product will give rise to products of 137 bp, 
25 102 bp and 3 5 bp, digestion of a homozygous product will give rise to products of 102 bp and 
35 bp. Products can be separated and visualised on agarose gels following standard 
procedures (i.e. Molecular Gloning: Sambrook et al;, 1989^/6/^ 
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(3) 

Position Polymorphism Allele Frequency No of [ndividuals 

3888 T/C T74% C 26 % 23 



5 Polymorphism at position 3888 alters the third base of codon 1213 (Tyrosine - 

TAT/TAC). It has been shown that single nucleotide polymorphisms can cause different 
structural folds of mRNA with potentially difl^rent biological functions (Shen et al 1999, 
ibici). Polymorphism at position 3888 creates a Snal B recognition sequence (TACGTA). 

10 Forward primer (Positions 362-385 in Seq ID No 5) 

CCTGAACCCTACAGAATGTGAATTG - 
Reverse primer (Positions 828-804 in Seq ID No 5) 

CAGGTAGGTGTAGTTGTCAGTCeTC 

15 Amphfication of genomic DNA with these primers will generate a PGR product of 

467 bp- A product generated from a wild type template will not be digested by SnalB (New 
England Biolabs). Digestion;of a heterozygote produc 

245 bp and 222 bp, digestion of a Homozygous variant product wil l generate products of 245 
bp and 222 bp. Products can be separated and- visualisedjpn agarose gels following standard 
20 procedure^ (i.e., Molecular Cloning: Sambrook 

iNovel polymorphisms within promoter and 5' UTR -numbering refers t6 EMBL 
Accession Number D64016 

Allele Frequency No of Individuals 

C97% T 3 % . ^ 34 

The polymorphism at position 5 19 can be detected by a diagnostic e RFLP, since 
30 modification of position 5 1 6 creates a potentiar SphI recognition sequence (GCATGC); 
Polymorphism atpositiori 519 will modity 



25 (4) 

Position 

519 



Pplymorphism 

C/T 
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Diagnostic primer (Positions 485-518 in D64016) 
GGGTGCATCAATGCGGCCGAAAAAGACACGGCA 
Modified residues in bold underline 
Constant primer (Positions 724-741 in D64016) 
5 GTGTTCTTGGCACGGAGG 

Amplification of genomic DNA with these primers will generate a PGR product of 
256 bp. A product generated from a wild type template will not be digested by SphI (New 
England Biolabs). Digestion of a heterozygote product will generate products of 256 bp, 221 
10 bp and 35 bp, digestion of a homozygotc variant producl vvill generate products of 221 bp and 
35 bp. Products can be separated and visualised on agarose gels following standard 
procedures (i.e. Molecular Cloning: Sambrook et al., 1989, /A/^^ 

.(5) 

\5 Position Polymorphism Allele Frequency No of Individuals 

786 C/T C 98% T2% 50 

The polymorphism at position 786 can be detected by a diagnostics RFLP. since 
niodillcation of position 781,782 creates a Narl recognition sequence (GGCGCC). 
20 Polymorphism at position 786 will modify the recognition sequence (GGGGCC/T). 

Diagnostic prinier (Positions 751-785 in D64016) 
GGCGCGGCCAGCTTGCCTTGGATCGGACTTGGCGC 
Modified residues in bold underHne 
25 Constant primer (Positions 869-890 in D640 16) 

Amplification of genomic DNA with these products will generate a PGR product of 
139 bp. Digestion of a product from a wild type template with Narl (New England ^ 
will generate products of 105 bp and 34 bp. Digestion of a heterozygote product will generate 
30 products of 139 bp, 105 bp and 34 bp. The homozygdus variant product will not be digested 
by Narl. Products can be separated and yisualised^ 
procedures (i.e. Molecular Cloning: Sambrook et ah, 1989, ibid). 
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(6) 

Position Polymorphism Allele Frequency No of Individuals 

1422 C/T C98% T 2% 25 

5 

Polymorphism at position 1422 alters an EagI recognition sequence (CGGC/TCG). 

Forward Primer (Positions 1251-1272 in D64016) 
Reverse primer (Positions 1673-1694 in D64016) 

10 

Amplification of genomic DNA with these primers generates a PGR pmduct of 443 
bp. Digestion of product from a wild type template with Eag 1 (New England Biolabs) will 
generate products of 271 bp and 143 bp. Digestion of a heterozygote product will generate 
products of 443 bp, 271 bp and 143 bp. The homozygous variant product vvill not be cleaved 
1 5 by Eag L Products can be separated and visualised on agarose gels following standard 
procedures (i.e; Molecular Cloning: Sambrook et ah, 1989, /'^^^^^^ 

Position Polymorphism Allele Frequency No of Individuals | 

20 1429 G/T G76% T24% 25^.,,, 

The polymorphism at position 1429 cati be detected by a diagnostic e RFLP, since 
modification of position 1431 ,1432 creates a Hinc H recognition sequence (GTTGAC). 
Polyniorphism at position 1429 vvill modify the recognition sequence (G/^ 

25 , 

Diagnostic primer (Reverse, positions 1430-1463 in D64016) 

CTGCTCGCCCGGTGCCCGCGCTCCCGGCGGTTATV 
Modified bases in bold underline 

Constant primer (Forward, positions 1251-1272 in D64016) 



Amplification of genomic DNA with these primers will generate a PGR product of 
21 2 bp, ^Digestion of product trom a wild type ternpiate with Hinc II (New England Biolabs) 
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vvill generate products of 1 78 bp and 34 bp, digestion of a heterozygote product will give rise 
10 products of 212 bp. 178 bp and 34 bp. A homozygote variant product will not be digested 
by Hinc II. Products can be separated and visualised on agarose gels following standard 
procedures (i.e. Molecular Cloning: Sambrook et ai., 1 989, ihicf). 



Novel polymorphism identified in intron 24 



Primer locations for scan of intron 24, All locations in this section refer to Seq ID No 3. 



1 0 Product 

193-538 
Position 

454 



Forward Primer 

193-216 
Polymorphism 

G/A 



Reverse Primer Temp Time 

538-5 1 5 55o 90 sec 

Allele Frequency No of Individuals 

G 76% -A 24% 23 



1 5 [Novel polymorphism identified in intron 28 

Primer locations for scan of intron 28, All locations in this section refer to Seq ID No 5. 



1^'oduct 

362-828 
20 Position 

696 



Forward Primer 

362-385 
Polymorphism 

T/C 



Reverse Primer 

828-804 
Allele Frequency 

T 76% C 24% 



Temp Time 

55o 90 sec 

No of Individuals 

23 



Novel genomic sequence flanking exons within the human flt-l gene 

25 Two overlapping B AC clones were isolated - 5 1L6 (5') and 87P12 (3^) 
Sequencing Primers (positions refer to Accession X 51602) 

Exon 17 (BAG clone 87P12) 

30 Forward 2641-2664 
Reverse 2664-2641 
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^ Exon 21 (BAG clone 87P12) 

Forward 1357-1380 
Reverse 1380-1357 

5 Exon 24 (BAG clone 87P12) 

1-orward 3452-3478 
Reverse 3529-3506 

Exon 27 (BAG clone 87P12) 

1(3 Forward 3785-381 1 
Reverse 381 1-3785 

Exon28 (BAGGlone87P12) 

Forward 3918-3946 
15 Reverse 3946-3918 



Application Mo: , 
Pillsbufv iVJadison a- Sutro 
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